First Steps Towards an Annotated Database of American English

نویسندگان

  • Mitchell P. Marcus
  • Beatrice Santorini
  • David Magerman
چکیده

This paper reports on one of the first steps in building a very large annotated database of American English. We present and discuss the results of an experiment comparing manual part-of-speech tagging with manual verification and correction of automatic stochastic tagging. The experiment shows that correcting is superior to tagging with respect to speed, consistency and accuracy. Comments University of Pennsylvania Department of Computer and Information Science Technical Report No. MSCIS-90-46. This technical report is available at ScholarlyCommons: http://repository.upenn.edu/cis_reports/569 First Steps Towards An Annotated Database of American English MS-CIS-90-46 LINC LAB 175 Mitchell P. Marcus Beatrice Santorini David Magerman Department of Computer and Information Science School of Engineering and Applied Science University of Pennsylvania Philadelphia, PA 19 104

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Attitude of Muslim Students towards English Idioms and Proverbs

This study aimed at investigating the attitude of Muslim students towards the use of certain English idioms and proverbs. Thirty Muslim students were asked to express their reactions and feelings towards two categories of English idioms and proverbs: the first category included idioms and proverbs containing the names of animals that are prohibited in Islam, and the second category contained cu...

متن کامل

Attitudes towards English as an International Language (EIL) in Iran: Development and Validation of a New Model and Questionnaire

This study aimed at developing and validating a new model and instrument to explore attitudes of Iranian EFL learners towards English as an International Language (EIL). In so doing, the researchers followed several rigorous steps including extensive literature review, content selection, item generation, designing the rating scales and personal information part, Delphi technique, item revision,...

متن کامل

Very Large Annotated Database of American English

Object ive To construct a data base (the "Penn Treebank') of written and transcribed spoken American English annotated with detailed grammatical structure. This data base will serve as a national resource, providing training material for a wide variety of approaches to automatic language acquisition, a rei~rence standard for the rigorous evaluation of some components of natural language underst...

متن کامل

Exploring Male and Female Iranian EFL Learners’ Attitude towards Native and Non-native Varieties of English

This study investigated whether Iranian EFL learners are aware of different varieties of English spoken throughout the world and whether they have tendency towards a particular variety of English. Likewise, it explored the attitudes of Iranian EFL learners towards the native and non-native varieties of English. Moreover, it made an attempt to investigate whether such attitudes are gender-orient...

متن کامل

An Investigation of Iranians and International English Students'Attitudes towards Intercultural Communicative Competence

The present study aimed to investigate the attitudes and perceived nature of thinking and understanding towards intercultural communicative competence (ICC) among International English major students. Accordingly, this study employed the paradigm of a sequential mixed-method research, in which it comprised a qualitative phase followed by a quantitative phase. The participants of the first phase...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015